DRAFT: Scalability Bugs: When 100-Node Testing is Not Enough

نویسندگان

Tanakorn Leesatapornwongsa

Cesar A. Stuardo

Riza O. Suminto

Huan Ke

Jeffrey F. Lukman

Haryadi S. Gunawi

چکیده

We highlight the problem of scalability bugs, a new class of bugs that appear in “cloud-scale” distributed systems. Scalability bugs are latent bugs that are clusterscale dependent, whose symptoms typically surface in large-scale deployments, but not in small or mediumscale deployments. The standard practice to test large distributed systems is to deploy them on a large number of machines (“real-scale testing”), which is difficult and expensive. New methods are needed to reduce developers’ burdens in finding, reproducing, and debugging scalability bugs. We propose “scale check,” an approach that helps developers find and replay scalability bugs at real scales, but do so only on one machine and still achieve a high accuracy (i.e., similar observed behaviors as if the nodes are deployed in real-scale testing).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Techniques for Identifying Elusive Corner-Case Bugs in Systems Software

Modern software is plagued by elusive corner-case bugs (e.g., security vulnerabilities). There are no scalable, automated ways of finding them, therefore such bugs can remain hidden until software is deployed in production. This thesis proposes approaches to solve this problem. First, we present black-box and white-box fault injection mechanisms, which allow developers to test the behavior of t...

متن کامل

Enhancing Automated Fault Discovery and Analysis

ENHANCING AUTOMATED FAULT DISCOVERY AND ANALYSIS By Jared David DeMott Creating quality software is difficult. Likewise, offensive researchers look to penetrate quality software. Both parties benefit from a scalable bug hunting framework. Once bugs are found, an equally expensive task is debugging. To debug faults, analysts must identify statements involved in the failures and select suspicious...

متن کامل

ABHRANTA: Locating Bugs that Manifest at Large System Scales

A key challenge in developing large scale applications (both in system size and in input size) is finding bugs that are latent at the small scales of testing, only manifesting when a program is deployed at large scales. Traditional statistical techniques fail because no error-free run is available at deployment scales for training purposes. Prior work used scaling models to detect anomalous beh...

متن کامل

TAC: A Topology-Aware Chord-based Peer-to-Peer Network

Among structured Peer-to-Peer systems, Chord has a general popularity due to its salient features like simplicity, high scalability, small path length with respect to network size, and flexibility on node join and departure. However, Chord doesn’t take into account the topology of underlying physical network when a new node is being added to the system, thus resulting in high routing late...

متن کامل

Understanding , Detecting , and Repairing Performance Bugs

Software performance is critical for how end-users perceive the quality of software products. Performance bugs—programming errors that cause performance degradation—lead to poor user experience and low system throughput. Despite advances in profiling techniques, performance bugs still escape in production runs. There are two key reasons why performance bugs are not effectively detected during i...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2017

DRAFT: Scalability Bugs: When 100-Node Testing is Not Enough

نویسندگان

چکیده

منابع مشابه

Techniques for Identifying Elusive Corner-Case Bugs in Systems Software

Enhancing Automated Fault Discovery and Analysis

ABHRANTA: Locating Bugs that Manifest at Large System Scales

TAC: A Topology-Aware Chord-based Peer-to-Peer Network

Understanding , Detecting , and Repairing Performance Bugs

عنوان ژورنال:

اشتراک گذاری